The Yueli Lingji team addresses the issue of insufficient spatial perception in existing vision-language-action models in complex environments, which rely on 2D images. They propose a new solution aimed at enhancing robots' ability to judge depth and position in 3D space.
Microsoft open-sources the image-to-3D tool TRELLIS.2, which can quickly generate textured 3D models from a single image, outputting .glb format files compatible with platforms such as Blender and Unity. The tool uses a 4B model and supports image processing at resolutions ranging from 512³ to 1536³. On an NVIDIA H100 GPU, generating a 512³ model takes approximately 3 seconds.
Bambu Lab launches 'Print You' 3D figurine generator, powered by Tencent Hunyuan 3D 3.0 model, enabling users to create high-quality, printable 3D models from uploaded images, lowering customization barriers for enthusiasts and beginners.....
Meta AI's SAM3D generates textured 3D assets from single 2D photos, outperforming existing methods with physics-aware geometry and materials for AR/VR, robotics, and film.....
Microsoft Trellis 2 AI, quickly convert images into high-quality 3D models with PBR textures
A free AI photo editor that quickly achieves creative edits.
Advanced AI technology can instantly convert text and images into 3D models without the need for 3D modeling experience.
SAM 3D: Driven by AI, instantly convert 2D images into professional-grade 3D models.
Google
$0.49
Input tokens/M
$2.1
Output tokens/M
1k
Context Length
Openai
$2.8
$11.2
Xai
$1.4
$3.5
2k
$7.7
$30.8
200
-
Anthropic
$105
$525
$0.7
$7
$35
$17.5
$21
Alibaba
$2
$20
$4
$16
Baidu
128
$6
$24
256
$1
$10
ImrozeAslamMalik
LGM is an integrated image-to-3D workflow incorporating multi-view diffusion models, capable of generating high-quality 3D content from a single image.
MonsterMMORPG
The TRELLIS image-conditioned version is a large-scale 3D generation model capable of generating corresponding 3D models from input 2D images.
VAST-AI
TripoSG-scribble is an AI tool that rapidly generates 3D models from scribble images and text prompts. As a variant of TripoSG, it is suitable for creative design and rapid prototyping.
Stable-X
An improved version of TRELLIS that supports converting 2D images into 3D models, with special support for normal conditioning.
homebrewltd
AlphaSpace is an innovative approach designed to enhance the spatial reasoning capabilities of language models for robotic manipulation in 3D Cartesian space.
Menlo
AlphaSpace is an innovative method that enhances language models' spatial reasoning capabilities for robotic manipulation in 3D Cartesian space.
TrianC0de
TripoSR is a fast feed-forward 3D generation model developed collaboratively by Stability AI and Tripo AI, capable of rapidly generating 3D models from a single image.
zhang3z
dust3r is a deep learning model for generating 3D models from images, supporting multi-view 3D reconstruction.
IvanTang
ENEL is a model exploring the potential of encoder-free architecture in 3D large multimodal models.
stanfordmimi
A family of medical image processing models consisting of six large-scale, generalizable 2D/3D variational autoencoders capable of encoding medical images into compressed latent representations and achieving high-fidelity image reconstruction.
craftsman3d
CraftsMan is a high-fidelity mesh generation system based on native 3D generation and interactive geometry optimization, capable of generating high-quality 3D mesh models from a single image.
WizWhite
A LoRA model for generating paper miniature models, specializing in creating flat cardboard scenes and 3D paper objects with a vintage style.
facebook
VFusion3D is a large-scale feed-forward 3D generation model trained with limited 3D data and extensive synthetic multi-view data, representing the first work to explore scalable 3D generation/reconstruction models.
jadechoghari
VFusion3D is a large-scale feed-forward 3D generation model trained with limited 3D data and extensive synthetic multi-view data, exploring scalable 3D generation/reconstruction models.
dylanebert
LGM is a high-resolution 3D content creation pipeline integrating multi-view diffusion models, specifically designed for 3D machine learning courses.
naver
DUSt3R is a deep learning model for generating 3D geometric models from images, capable of easily handling geometric 3D vision tasks.
Yiwen-ntu
MeshAnything is an artist-grade mesh generation model based on autoregressive Transformers, capable of converting images or point clouds into high-quality 3D mesh models.
GoodBaiBai88
M3D is a 3D medical image analysis technology based on multimodal large language models, including the M3D-Data dataset, M3D-LaMed model, and M3D-Bench evaluation benchmark.
zxhezexin
OpenLRM is an open-source implementation of the LRM paper for generating 3D models from a single image
OpenLRM is an open-source implementation of the LRM paper, capable of generating 3D models from a single image, with multiple versions of different scales.
Blender MCP VXAI is a powerful integration tool that allows users to control Blender through natural language to create and modify 3D models, animations, and scenes. It simplifies complex operations and supports real-time export to projects.
FreeCAD MCP is a plugin for controlling FreeCAD through Claude Desktop, supporting various design functions such as creating 3D models from 2D drawings.
An MCP server based on OpenSCAD that generates multi - view images through AI and reconstructs them into parametric 3D models, supporting remote CUDA - accelerated processing.
The OpenSCAD MCP Server is a tool for generating parametric 3D models through text or images, supporting multi-view reconstruction and remote processing.
The OpenSCAD MCP Server is a service for generating parametric 3D models from text or images. It supports multi - view reconstruction, AI image generation, remote CUDA processing, and workflow approval, and finally outputs OpenSCAD - compatible model files.
Trellis MCP is an interface service that connects AI assistants with Trellis 3D generation models, supporting rapid generation of 3D assets through natural language and importing them into Blender. This project is based on an open - source model and requires self - deployment of the API backend. It is fast and free, but there are stability risks.
The game asset generator uses AI models and the MCP protocol to quickly generate 2D and 3D game resources through text prompts.
An open - source project that integrates Blender with local AI models to control 3D modeling through natural language.
The MCP STL 3D Relief Generator is a tool that converts 2D images into 3D relief models, supporting functions such as controlling model size, adding a base, and depth inversion. It is suitable for 3D printing and rendering.
The Meshy AI MCP Server is a model context protocol server for interacting with the Meshy AI API, providing functions such as generating 3D models from text and images, applying textures, and remeshing models.
An MCP server for processing, validating, optimizing, and analyzing 3D models (supporting glTF/GLB formats), providing functions such as model analysis, format conversion, compression, and texture optimization
An MCP server for interacting with the Sketchfab 3D model platform, supporting functions such as searching, viewing details, and downloading 3D models.